Dynamic Frequency Management in 802.11-based 
Multi-Radio Wireless Networks 



George Afhanasiou 3 , Leandros Tassiulas b 



"Automatic Control Lab, School of Electrical Engineering, 
KTH Royal Institue of Technology, Sweden. E-mail: georgioa@kth.se 
b Department of Computers and Communications Engineering, 
University of Thessaly, Greece. E-mail: leandros@uth.gr 



Abstract 

Efficient channel selection is essential in 802. 1 1 mesh deployments, for minimizing contention and interference among 
co-channel devices and thereby supporting a plurality of QoS-sensitive applications. A few protocols have been 
proposed for frequency allocation in such networks, however they do not address the problem end-to-end. In this 
paper, we present a general formulation of the channel selection problem taking into account the performance of 
both mesh-access and mesh-backhaul. Moreover, we propose ARACHNE, a routing-aware channel selection protocol 
for wireless mesh networks. ARACHNE is distributed in nature, and motivated by our measurements on a wireless 
testbed. The main novelty of our protocol comes from adopting a metric that captures the end-to-end link loads across 
different routes in the network. ARACHNE prioritizes the assignment of low-interference channels to links that (a) 
need to serve high-load aggregate traffic and/or (b) already suffer significant levels of contention and interference. 
Our protocol takes into account the number of potential interfaces (radios) per device, and allocates these interfaces 
in a manner that efficiently utilizes the available channel capacity. We evaluate ARACHNE through extensive, trace- 
driven simulations and we show that approaches the optimal channel selection. We observe that our protocol improves 
the total network throughput, as compared to three other representative channel allocation approaches in literature. 

Keywords: Frequency management, Wireless networks, IEEE 802.1 1, Cross-layer 



1. Introduction 

Wireless mesh networking has been touted as the new 
technology that can support ubiquitous end-to-end con- 
nectivity. In wireless mesh networks, information has 
to be routed via multiple wireless hops before it can 
reach the destination HI 01 ■ A critical requirement for 
the efficient routing of packets is the identification and 
use of interference-limited wireless links. Therefore, 
intermediate mesh hops along a route need to operate 
in frequencies, where contention and interference are 
as low as possible, especially in highly-dense mesh de- 
ployments. We ask the question: How can we allocate 
frequencies in a mesh network, in order to maximize the 
total network throughput, in a distributed manner? 

In order to efficiently allocate the set of available 
channels to nodes, the load at each individual link needs 
to be taken into account. Here, the load at the mesh 
access and backhaul levels could be represented in var- 
ious ways potentially involving the number of 



neighbor transmitters, the traffic demand, the amount of 
traffic flowing through each node, etc., as we discuss 
later. Previous studies on frequency selection, however, 
do not consider the end-to-end load distribution across 
entire routes; they consider the sub-problems of either 
the access level, between clients and access points (APs) 
flH], or the backbone portion of the network |0,Q,[ll- 
We argue that frequency selection algorithms should 
prioritize the assignment of low-interference channels 
at highly-loaded mesh links, both at the access and the 
backhaul levels. As a simple example, consider the con- 
nectivity graph of Fig. [TJ where nodes A, B and C gen- 
erate equal amounts of traffic towards node E, while the 
same channel is initially used by all links. 

In this scenario, the DE link facilitates the aggre- 
gated traffic, destined to node E. Thus, DE should 
be assigned a frequency such that the aggregate traf- 
fic towards E is efficiently forwarded, i.e., the bottle- 
neck situation with DE, due to potentially low SINR 
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Figure 1 : Load-aware channel selection. 



or high contention levels, is avoided to the extent pos- 
sible. Note here that the frequency selection outcome 
will likely affect the decision of load-aware routing pro- 
tocols, such as RM-AODV (Radio Metric Ad Hoc On- 
Demand Distance Vector Routing) |@]. Hence, both the 
frequency selection and load-aware routing functionali- 
ties are inter-dependent and must be considered in con- 
junction. 

In this paper we present a general formulation of 
the channel allocation problem, taking into account the 
important parameters (i.e., interference level, packet 
dropping and transmission rate) that affect the end-to- 
end throughput in the network. Moreover, we propose 
ARACHNE, a load and routing aware channel selection 
protocol for wireless mesh networks. ARACHNE per- 
forms end-to-end channel selection along a route, by 
adopting a variation of a load characterization metric 
13]. Given that the load-aware routing choices are af- 
fected by the frequency selection policy, ARACHNE 
combines frequency selection and route selection under 
the same unified framework and approaches the max- 
imum throughput in the network. To the best of our 
knowledge, this is one of the first works that present 
a frequency selection protocol across entire routes, in- 
volving both the access and the backhaul levels in con- 
junction with the load-aware routing of information 
between end-hosts. We evaluate ARACHNE through 
trace-driven simulations, using the OPNET simula- 
tion platform. We observe that ARACHNE outperforms 
other channel allocation mechanisms for mesh networks 
in terms of overall network throughput, average delay 
and dropped data. 

The rest of this paper is structured as follows. In 
section [2] we present our preliminary experiments and 
discuss the previous approaches related to our work, 
which motivate our problem formulation and the de- 
sign of ARACHNE. Section [3] presents the modelling 
of channel allocation in wireless mesh networks and 
describes the airtime metric that we employ, based on 
which ARACHNE discovers the channel with the maxi- 
mum long-term throughput. In section|4j we describe in 
detail the design of ARACHNE for both the access and 
the backhaul levels. We evaluate our protocol through 
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Figure 2: The testbed deployment in the 4th (left) and 
the 5th (right) floor of our building. 

simulations, in section [5] Finally, section [6] concludes 
this paper. 

2. Motivating our Channel Allocation Policy 

In this section, we describe a set of preliminary ex- 
periments on our wireless testbed, which motivate the 
design of ARACHNE. We first provide a description of 
our testbed platform, and subsequently we discuss our 
experiments and their interpretations. We also discuss 
relevant previous work. 

Our wireless testbed deployment (Fig. |2]i consists of 
9 nodes that are based on the ORBIT hardware configu- 
ration, and run a Debian Linux distribution with kernel 
v2.6 over NFS. Each node is equipped with 1 GHz CPU, 
512 Mbytes of memory, and a WN-CM9 wireless card, 
which carries the AR5212 Atheros chipset. We set the 
cards to 802.1 lg mode and we use channels 1, 6 and 
1 1 . For the purposes of these preliminary experiments 
we consider fixed routes between source and destina- 
tion. We inject UDP traffic with various constant bit 
rates (CBR) and with packet size equal to 1500 bytes. 
For each end-to-end traffic session we set different ap- 
plication data rates; we utilize the iperf measurement 
tool. 

We provide a representative experiment in what fol- 
lows. Consider the following two simultaneously ac- 
tive routes (see Fig. [2j>: (a) 16^15^20^19, and (b) 
13— >15— >20— »14. These routes have one link in com- 
mon, 15— >20, while all links are of similar quality in 
terms of achievable throughput in isolation. We apply 
two different channel selection policies; for both poli- 
cies we make sure that connectivity is maintained be- 
tween the end-hosts, i.e., two consecutive relays use the 
same channel on one of their interfaces. We first con- 
sider the channel policy, A 1, which assigns channels ac- 
cording to the interference experienced (allocates the 
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Figure 3: Policy A2 outperforms policy Al, especially 
at high loads. 



channel with the minimum aggregate interference, ob- 
served through RSSI measurements). Subsequently we 
assign channels to nodes, additionally taking into ac- 
count the link loads, in terms of both link quality and 
aggregate traffic service; we call this policy A2. In other 
words, we manually prioritize the channel selection on 
the link 15— >20; the rest of the links choose their chan- 
nels as previously, given the selection on link 15— >20. 
For both approaches, we measure the total end-to-end 
throughput for the two routes. Fig. [3]depicts these mea- 
surements for different source data rates. 

First, we observe that the total end-to-end through- 
put is always higher with A2. Second, we see that the 
throughput improvement with A2 is much higher than 
with Al, when the data rates of the routes are of sim- 
ilar magnitude. This is because A2 assigns a "better" 
channel to the 15— >20 link, which has to serve more 
traffic than the other links. With Al, however, this 
acts as a bottleneck to the performance of the individ- 
ual routes: this bottleneck situation cannot be captured 



by simply measuring the interference at each channel. 
On the other hand, when one of the routes, (say (a)) has 
significantly lower traffic demands, the aggregate traffic 
that traverses link 15— >20 comes mainly from the route 
(b) over a period of time; in such a case, A 1 works quite 
well, too. This observation constitutes a key element for 
the design of ARACHNE, in terms of selecting channels 
and allocating the available interfaces to specific routes, 
as we explain in section [4] 

The LCCS (Least Congested Channel Search) 



method 111 111 was the first effort towards allocating a set 
of available channels to wireless devices. With LCCS, 
devices (e.g. APs) periodically scan the set of avail- 
able channels and select the one with the lowest levels 
of contention (as the name suggests). However, there 
are many topological scenarios where LCCS is unable 
to capture the total interference in the channel, as ex- 
plained in 111 211 . 

Similarly, Leith and Clifford lfl3ll propose a self- 
managed distributed channel selection scheme, wherein 
each AP passively measures the received power from 
the packets transmitted by neighbor APs. 

Kauffmann et al. in J3l propose a distributed fre- 
quency selection algorithm that minimizes the global 
interference in the network. In this work the authors 
minimize the total interference which can result in im- 
proved user throughput. However, their algorithm does 
not consider the number of clients in the network; it as- 
sumes purely downlink saturated traffic and that all APs 
have affiliated clients. 

Similarly, Rozner et al. in lfl4ll propose a chan- 
nel assignment scheme for WLANs, taking into account 
traffic demands. They show that taking into considera- 
tion the current traffic demands at APs and clients, the 
quality of the channel assignment can be greatly im- 
proved. 

The MaxChop mechanism |0] provides high levels 
of fairness among users using channel hopping. How- 
ever, it requires tight synchronization between AP and 
clients, while it is difficult to implement efficiently with 
off-the-shelf hardware. 

In J3l the authors study the joint channel allocation 
and routing problem, assuming that traffic demands and 
network topology are known. They propose a central- 
ized algorithm that maximizes the aggregate through- 
put. 

Raniwala et al. |7[] propose a tree-based mesh archi- 
tecture, Hyacinth, where the local channel load infor- 
mation exchange facilitates the channel selection. Hy- 
acinth tries to address the joint problem of channel as- 
signment and routing in wireless mesh networks. The 
latter two approaches, , however, assume the avail- 
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ability of a global network view. 

Our protocol differentiates from these approaches by 
providing efficient end-to-end channel selection in a dis- 
tributed manner (access level and mesh backhaul). In 
ARACHNE there is no need of synchronized channel 
access. Moreover our protocol does not employ any 
tree-based architecture, which in some cases cannot rep- 
resent the actual network topology and its dynamics. Fi- 
nally, our work is fully compliant with 802. 1 1 s |@] wire- 
less mesh networks and it can be implemented on top of 
the existing IEEE 802.1 1 standards. In what follows, we 
present the metric that our protocol uses during channel 
assignment. 

3. Modelling Channel Allocation 

It is clear that the selfish objective of the APs is to 
select the channel that provides the maximum average 
throughput. Therefore, the APs need to first estimate 
the average throughput it can obtain from when operat- 
ing on a specific channel. In this section, we investigate 
important parameters that need to be taken into account 
for allocating the available channels to the APs in an 
efficient manner, through a sophisticated metric and a 
problem formulation that tries to maximize the through- 
put in the network. 

3.1. Metric for channel selection 

The metric that we consider in our work is called air- 
time metric. The airtime metric was first discussed in 
the 802. lis {9] standard, for the purposes of load-aware 
routing (RM-AODV routing protocol). This metric re- 
flects the load on a wireless router (AP) in terms of the 
average delay a transmission of a unit size packet expe- 
riences. RM-AODV which is the default routing proto- 
col in 802.1 ls-based wireless mesh networks, employs 
the airtime metric in order to provide end-to-end paths 
with the minimum total airtime cost. 

Formally, the airtime cost of a link / that supports 
the communication of two routers, using channel or fre- 
quency /, is given as: 
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In (03, O ca is the channel access overhead, O p is the pro- 
tocol overhead and B, is the number of bits in the test 
frameQ. Some representative values for these constants, 



for 802. 1 lg, are: O ca + O p = l .25ms and B, = 8224bits. 
Furthermore, R/j and e pt are the current transmission 
rate and frame-error rate, respectively, in Mbps, for the 
test frame size B, in channel /. In other words, the esti- 
mation of e pl corresponds to transmissions of standard- 
size frames B, at the current transmit bit rate Rij. 

In order to understand the reasoning behind the air- 
time metric, we should refer to the recent work investi- 
gating the calculation of the average throughput of the 
802.11 based WLANs 10, |0, CI, 10 ■ It has been 
observed that when there are several flows with differ- 
ent physical transmission rates, then the throughput of 
all flows is bounded by the slowest transmission rate 
1 1911 . In order to explain this anomaly, further studies 
have been conducted, and the average uplink throughput 
in a single cell environment is calculated under satura- 
tion and decoupling approximations in H 1 5fl and 111 811 . 
The saturation approximation states that there are al- 
ways packets backlogged on every user. Meanwhile, 
with the decoupling approximation it is assumed that 
when there are « users, the aggregate attempt process of 
(n - 1) nodes is independent of the back-off process of 
any given node. 

We consider the simple case, when all nodes have 
the same back-off parameters, each node is the trans- 
mitter for a single flow, and all packets have lengths are 
equal to L. As derived in [18], the total average network 
throughput 0(B) is given as: 
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(2) 

where B is the attempt rate (probability) in the equilib- 
rium, Ci is the physical transmission rate of node i, To 
is the fixed overhead with packet transmission and T c is 
the fixed overhead for an RTS collision. Due to the ex- 
ponential back-off behavior of the nodes and the decou- 
pling approximation, it can be shown that the attempt 
probability of a node accessing the channel can be de- 
termined in terms of a given collision probability y as: 



G(r) = 



2jk=o y 



(3) 



The transmission of test frames is necessary, in order to derive 
values for the computation of the airtime cost. 



where K is the maximum number of attempts allowed 
under the protocol, and is the mean back-off at the 
k'' 1 attempt. Meanwhile, the probability of collision of 
an attempt by a node is given by F(B) = 1 — (1 — pf~ 1 
due to the decoupling approximation. The equilibrium 
behavior of the system is governed by the solution of 
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Figure 4: The contention and protocol overhead calcu- 
lated according to ©. 



the fixed point equation y = F(G(y)). If this equation 
is solved it yields the collision probability from which 
the attempt rate in the equilibrium f3 can be determined 
from (0. 

Note that is the average delay per packet in the 
equilibrium, which is given as: 



p(n) — = 

1-(1 



p) n-l 
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The first three terms in © represent the delay due to 
channel contention and protocol overheads, and the last 
term represents the average transmission time of an L 
length packet by a node in the cell. 

Comparing the definition of the airtime metric Cij 
and p(n), we can see that the sum of O ca and O p corre- 
spond to the approximation of the first three terms in <j4j 
with a constant value. Clearly, the transmission of RTS, 
CTS, DATA and ACK frames may also get corrupted 
not only due to collisions but also due to channel errors. 
The authors in Il6ll extended the analysis in [15] to ac- 
count for wireless bit errors. The goodput expression 
in lfl6ll assumes that the wireless channel can be mod- 
eled by an appropriate Gilbert model with known tran- 
sition probabilities. In wireless networks with dynam- 
ically changing conditions, such an assumption is not 
practical. Therefore, in our method, the current frame 
error rate, e pt is measured by the users and AP. For each 
packet attempted to be transmitted, it would be in er- 
ror due to channel errors with probability e pt . Clearly, 
the average number of attempts until successful trans- 



mission would be 1/(1 - e£ f ). For each attempt on the 
average p(«) amount of time is experienced. The prod- 
uct of 1 /( 1 - e^ f ) and p(«) gives the airtime metric used 
in our algorithm. 

When the overhead is calculated as described in (0), 
we observe that it varies with the number of nodes in the 
cell. The overhead with respect to the number of nodes 
is given in Figure |4] when K — 7, bo = 16, b\ = 2 k bo, 
the slot length is 20fis, To = 52 and T c = 17 slots. The 
overhead varies between 59 and 66 slots, and thus, it is 
clear that there is weak dependency between the num- 
ber of users n and the observed overhead. Therefore, in 
our work the overhead is taken as a constant to avoid 
additional computational complexity. 

3.2. Optimal channel allocation problem formulation 

In this section we give a general formulation of the 
channel allocation in 802.1 1 -based multi -radio wireless 
networks. We formulate the problem of optimal allocat- 
ing the available channels, assuring that the maximum 
path end-to-end delay (total airtime cost of the links that 
are part of the path) in the network is minimized. Based 
on our previous analysis this will provide throughput 
maximization and balanced/stable network operation is 
assured OOjl: 
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Where P is the set of active paths in the network, F 
is the set of available channels/frequencies that could 
be used by the APs, L p is the links that belong to path 
p, aij is the probability that frequency / is used in the 
communication through link /, and Cij is the airtime 
cost of link / that operates in frequency / . 



4. Dynamic Channel Selection Protocol 

Inspired by the previous problem formulation and the 
airtime metric association with the average through- 
put in the network, we present ARACHNE, a load 
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and routing aware channel selection protocol. The de- 
sign of ARACHNE makes the following assumptions: 
a) Nodes of the mesh backhaul are equipped with at 
least 3 radios (2 for the mesh backhaul communica- 
tion and 1 for the access level), one of which is set 
to a pre-arranged channel, c p , which is the same for 
all nodes in the network, b) A client associates to the 
AP with the strongest signal (RSSI) among all neigh- 
bor APs (802.1 1-based association procedure), c) The 
set of channels used at the access level (AP-client inter- 
faces), is different from the channel-set used at the mesh 
backhaul. For simplicity in this paper, the AP-client in- 
terfaces operate in the 2.4 Ghz band; the 5 GHz band is 
used exclusively for the interfaces in the mesh backhaul, 
and d) We assume that traffic is exchanged among end 
hosts belonging to the same mesh network, i.e., traffic 
is not crossing different networks. 

As previously mentioned, ARACHNE is using air- 
time metric that is an approximation of the per packet 
latency. The airtime metric was first discussed in the 
802.11s J9t] standard, for the purposes of load-aware 
routing (RM-AODV routing protocol). This metric re- 
flects the load on a wireless router (AP) in terms of 
the average delay a transmission of a unit size packet 
experiences. RM-AODV which is the default routing 
protocol in 802. 1 ls-based wireless mesh networks, em- 
ploys the airtime metric in order to provide end-to-end 
paths with the minimum total airtime cost. We adopt 
this metric in ARACHNE for the purposes of our pro- 
posed channel selection functionality. 

Several studies have shown that the number of erro- 
neously received packets increases and the transmission 
rate decreases when the cells in the network interfere 
[21], I22I1 . J3]. The airtime metric takes into account 
the packet error rate as well as the transmission rate; 
hence, it reflects the performance at a particular commu- 
nication channel. Besides, our analysis shows that the 
average airtime cost is an approximation of the average 
per-packet delay. Therefore, the average airtime cost is 
a representative metric that reflects the channel perfor- 
mance and also approximates the maximum throughput 
in the network. 

ARACHNE captures the performance of a bidirec- 
tional link in terms of estimated throughput at a par- 
ticular channel, by measuring the average airtime cost 

for both uplink and downlink, C uf = C" p f + Cf' f wn (air- 
time cost for a particular link / that operates on chan- 
nel /), and applies a channel selection methodology 
where the channel with the minimum Cij is chosen. 
This channel selection policy determines the frequency 
with the minimum average per-packet delay in both up- 



link and downlink, thereby approximating the maxi- 
mum throughput in the communication. The goal of 
ARACHNE is to assign channels to mesh nodes (their 
associated links), such that the average airtime metric 
is minimized, both at the access and the backhaul lev- 
els. Hence, ARACHNE involves two procedures, PI 
and P2, one for each level. ARACHNE is executed ex- 
clusively by the nodes that belong to the mesh backhaul, 
i.e., the relay nodes as well as the APs that connect the 
end-hosts with the mesh network. The channel discov- 
ery is initiated by the AP of the source host. In what 
follows, we describe the operations of the protocol, PI 
and P2, for each of the two levels of operation. 

4.1. Channel Selection at the Access Level (PI) 

The access level involves the communication of the 
end hosts (clients) with mesh nodes at the edge of the 
backhaul (APs). Let us assume that host A wishes to 
send traffic to another host B. This traffic will flow 
through A's AP, Ma, to the mesh backhaul and it will 
eventually reach Mb', the latter will finally forward the 
traffic to B. With ARACHNE, the frequency selection at 
the access level involves a channel discovery process at 
the two aforementioned APs, in order to find the chan- 
nel with the minimum airtime cost value (we provide 
the steps of this process below). The calculation of this 
value is performed for every scanned channel, and in- 
volves all the downward and upward links of Ma, Mb- 
Let us again assume that host A wants to send traffic to 
host B. This part of the protocol, PI, is executed by the 
APs at the access level, and consists of the following 
steps. 

1. [Pla]. Deriving interference information for 
the current channel. At the nominal start of 
ARACHNE, Ma is informed about the operational 
frequencies of its neighbor co-channel APs. This 
can be performed through passive scanning of pe- 
riodically transmitted beacons 1E3I1 . |24]. By the 
end of this step, Ma is aware of the total received 
power from all co-channel APs. 

2. [Plb]. Computing the downlink airtime cost. 
Ma calculates the aggregated downlink airtime 
cost with its clients, through a link performance- 
measurement procedure, described in detail in 
where airtime metric is used for user association. 

3. [Pic]. Computing the uplink airtime cost. The 
clients of Ma calculate their individual uplink 
costs. They piggy-back this information through 
their data frame transmissions towards Ma- By the 
end of this step, Ma has received information re- 
garding the uplink channel qualities from all its 
clients. 
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4. [Pld]. Deciding if the current channel is appro- 
priate. Ma receives the information from the client 
and computes the average airtime cost (for both up- 
link and downlink) for the access level. If this is 
higher than a pre-defined threshold T, Ma remains 
in the same channel; otherwise it initiates a chan- 
nel discovery process. The value of threshold T is 
decided and controlled by the system designer. 

5. [Pie]. Computing the cumulative airtime cost 
at the next available channel. M A and its clients 
switch to the next channel and repeat steps Pla - 
fie. channels have been visited, Ma finally selects 
the channel with the minimum average airtime cost 
(for both uplink and downlink). 

Note here that process PI (of selecting a channel at 
the access level) is actually independent of P2; although 
they are executed in parallel, they do not affect each 
other to a large extent, in terms of convergence time, 
since they utilize different sets of channels. 

4.2. Channel Selection at the Mesh Backhaul (P2) 

The role of the mesh backhaul is to serve the forward- 
ing of packets towards their final destinations. Undoubt- 
edly, the network connectivity is affected by a poten- 
tial channel selection policy that may be applied in the 
mesh backhaul. ARACHNE's channel selection frame- 
work ensures connectivity at all times in the network, 
while the selection of a frequency at a particular link 
takes into account the load of the routing sessions that 
are traversing the link. 

For the purposes of this study, we assume that the 
RM-AODV routing protocol 131 is applied on the mesh 
backhaul. RM-AODV is the default routing proto- 
col proposed in the context of 802.1 Is mesh networks, 
where the airtime cost is used as a routing decision met- 
ric in the mesh backhaul. In particular, the airtime met- 
ric is appended to the RREQ and RREP messages, dur- 
ing the route discovery process; finally, the end-to-end 
path with the minimum total airtime cost (the route that 
has the minimum load) is selected. Our choice of RM- 
AODV is motivated by the fact that this routing proto- 
col is based on the airtime metric. Note, however, that 
ARACHNE can operate in conjunction with any load- 
aware routing protocol! 

ARACHNE employs the Local Association Base Ad- 
vertisement (LABA) mechanism introduced in 802.11s 
Jgt], in order to disseminate information with regards to 
inform the entire mesh network about the clients (end- 
hosts) that are associated with each mesh AP (MAP). 
MAPs periodically broadcast LABA frames, which con- 
sist of the MAC addresses of the hosts that are associ- 



ated with. We have enhanced the LABA frames to in- 
clude load related information, as we explain later in the 
description of the main parts of our protocol. 

Our protocol prioritizes the channel assignment on 
the most loaded mesh APs, as well as on the mesh APs 
that are expected to be highly loaded in the near future. 
In particular, ARACHNE observes the per-link load, for 
the links that serve one or more routes in the mesh; the 
load is captured in the airtime cost metric. The main 
problem in designing distributed channel selection poli- 
cies is the channel dependencies that arise between the 
nodes that are part of the mesh network. For example, 
we assume that in the simple network in Fig. Q]nodes are 
equipped with 2 wireless interfaces. Node D uses chan- 
nel a in order to communicate with node E and node E 
uses channel b to communicate with nodes F and G. In 
case that in the link E-F the current channel is heavily 
loaded, node E uses a new channel c that operates better 
at the link E-F. As E has only two interfaces, channel 
c must be used at the link E-G too. Unfortunately, a 
strong dependency between links E-F and E-G is estab- 
lished. Channel c may not operate effectively at the link 
E-G and therefore the performance of the network is de- 
creased. The aforementioned ripple effect could be fur- 
ther propagated in dense/huge mesh networks. In order 
to avoid ripple effects in the channel selection process 
|0], we categorize the wireless interfaces at each mesh 
AP that serves as a relay as: 

• IN network interfaces that are used for data recep- 
tion, 

• OUT network interfaces that are used for data for- 
warding. 

In addition, each node can assign channels only to its 
OUT interfaces. The IN interfaces follow the channels 
that are assigned by the nodes that communicate with 
the current node. In our protocol we assume that each 
node is equipped with at least one IN and one OUT in- 
terface. 

In what follows, we describe the steps that are ex- 
ecuted by ARACHNE, for assigning channels at the 
mesh backhaul (P2). To begin with, we consider a net- 
work state, in which: (a) All nodes have already dedi- 
cated one of their OUT interfaces (I p ), to control chan- 
nel c p \ (b) The RM-AODV protocol has converged to 
a set of routes between end-hosts; (c) All of the other 
wireless interfaces of a mesh AP (besides I p ) have been 
randomly assigned a channel in the 5 GHz band. The 
channel assignment in P2 is comprised of the following 
steps. 
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1. [P2a]. Constructing a priority list. With 
ARACHNE, an AP starts the channel-scanning 
process at a time-instant dictated by its priority 
ranking. This priority is highly-related to the load 
of each mesh AP; the higher the load (the data that 
must forward), the higher the priority. In addition, 
we believe that in this prioritization the estimated 
load of a mesh AP in near future must be taken into 
account. In this way we guarantee that our protocol 
will converge to a long-lasting and a stable channel 
allocation. Hence, it is imperative that this list is 
constructed, before APs start scanning each chan- 
nel. Each mesh AP a calculates its priority rank 
according to its current load and its expected load, 
as: 

P a r =L a crm w x +L a est w 2 , 

where L a crnt is the current load, L" s( is the estimated 
load and wi, W2 are the weights that are used in the 
calculation (we will give more details about the se- 
lection of wi, W2 in the evaluation of our protocol). 
As far as the calculation of the estimated load 
L a est is concerned, we adopt the estimation method 
(based on historical data), proposed in J25ll . In this 
approach the authors design a trace-based traffic 
model in order to predict the aggregated traffic de- 
mands of an AP in near future. Time-series analy- 
sis is the basis of this traffic estimation model. The 
accuracy of this model is high and in combination 
with our load-aware channel selection protocol we 
achieve long-lasting and stable channel allocation 
in the mesh network. 

Besides, the priority ranks are incorporated into the 
LABA frames and are broadcasted in the network. 
Consequently, a sorted list (in terms of channel 
selection priority rank) is disseminated and main- 
tained by all APs. Hence, by the end of this step 
each AP knows the priority of all APs in the net- 
work. 

2. [P2b]. Performing channel-scanning. Let us as- 
sume that each AP can scan a set of K channels. 
The first mesh AP in the priority list measures the 
cumulative airtime cost for each of the K channels, 
and constructs a local, temporary, sorted list with 
the cumulative airtime cost, per channel. 

3. [P2c]. Assigning route sessions to available OUT 
interfaces. Each mesh AP typically prefers to as- 
sign low-airtime channels to links that serve high- 
load end-to-end traffic sessions. ARACHNE man- 
ages to efficiently utilize the available spectrum, by 
providing a balanced channel and interface assign- 
ment in the network. A mesh AP a calculates the 



maximum load L a , that can be assigned at each of 
its OUT interfaces, as (clearly, when the number 
of available interfaces is higher than the number of 
served sessions, ARACHNE allocates an interface 
to a particular session): L a h = L" crnt I N a 0UT , where 
Nq VT is the number of the available OUT inter- 
faces of mesh AP a, and L" rnt is the current load 
that must be served by the OUT interfaces. The 
main constrains of the interface-assignment strat- 
egy are: (a) The load assigned to an OUT interface 
is less than L a , , and b) The load is proportionally 
allocated to the available OUT interfaces in terms 
of the load of the flows that pass through the cur- 
rent mesh AP. In other words, the load is balanced 
to the available OUT interfaces. 

4. [P2d]. Assigning channels to OUT interfaces. In 

order to assign a channel to an OUT interface, a 
coordination with the IN interface of the mesh AP 
that receives the traffic is required. In particular, 
a mesh AP A that selected the channel with the 
minimum airtime cost in the communication with 
a mesh AP B, sends an RTC frame to B announc- 
ing in this way that a new channel must be used 
in their communication. Then, B responds with a 
CTC frame and sets one of its IN interfaces to the 
selected channel. In case that B hasn't responded 
in a time window (timeoff), A retransmits its RTC 
frame. We must note here that there are special 
situations where B has recently assigned channels 
to its IN interfaces (during the same protocol iter- 
ation) and there are no available interfaces to as- 
sign the channel proposed by A. In other words, 
higher priority APs (compared to the priority of 
A) have previously assigned those channels to Bs' 
IN interfaces and there are no available IN inter- 
faces to use. In this case B responds with an XTC 
frame announcing this situation to A and the avail- 
able channels that are assigned to its IN interfaces. 
Lastly, A is restricted to use one of those channels 
in the communication with B (the channel with the 
minimum airtime cost in the link A-B). After the 
completion of the aforementioned process, the AP 
sends a "good-to-go" unicasQ message to the next 
AP in the list, to initiate its channel scanning and 
assignment process. 

5. [P2e]. Selecting channels iterative ly. When 
all APs have completed a round of channel scan- 
ning, RM-AODV updates the routing tables at the 



2 This unicast message speeds up the process of completing a full 
iteration of channel scanning for all participating mesh APs. 
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mesh backhaul, taking into account the new state 
of the network. Note that RM-AODV discovers 
routes based on the computation of the airtime 
cost, which will have likely changed after an itera- 
tion of the channel selection process. Steps P2a- 
P2e are repeated until the channel selection has 
converged. 

5. Evaluating ARACHNE 

In this section we present the evaluation study of our 
approach. During the first part of our validation method- 
ology we use the optimal solutions (resulted from so- 
lution of the channel allocation problem described in 
section [3]) as a benchmark in the direction of evaluat- 
ing ARACHNE. We are using IBM ILOG CPLEX Op- 
timizer l26ll to find the optimal solutions. Moreover, 
we evaluate ARACHNE through extensive simulations, 
which import both synthetic 11011 and real traces H27I1 . 
1 28[] . We compare our protocol against other channel se- 
lection schemes, and we present ARACHNE's predom- 
inance in terms of the total network throughput, average 
packet dropping and average transmission delay. 

We have implemented ARACHNE in OPNET 
We have also implemented the RM-AODV protocol and 
we consider this routing protocol in our simulations. 
The clients are uniformly distributed (at random) in the 
lOOOmx 1000m simulation area. All nodes use a default 
transmit power of 20 dBm and the source-destination 
pairs are randomly chosen in the network. We exper- 
iment with: (a) fully-saturated, end-to-end UDP traf- 
fic, (b) VoIP traffic and (c) real traffic traces from Dart- 
mouth College J23l and IBM J2I. The weight val- 
ues that are used in our simulation experiments are: 
w\ = 0.6 and wo = 0.4 (the current load affects the chan- 
nel selection procedure more than the estimated load in 
near future). As far as the duration of the measurement 
period is concerned, in our experiments the APs remain 
100ms in each scanned channel in order to gather the 
necessary information. The convergence of our mecha- 
nisms is reached rather quickly in our network topolo- 
gies, as will be presented later in the current section 
(close to 20sec in average and after a small number of 
iterations). Moreover, the traffic keeps flowing during 
the execution of ARACHNE and the network opera- 
tion is not affected. We chose accordingly the values of 
the performance threshold T in order to avoid unneces- 
sary and frequent protocol executions. Throughout our 
simulation experiments we compare the network perfor- 
mance with ARACHNE, against the single-channel as- 
signment, a random-channel allocation strategy, as well 
as the Hyacinth protocol J3]. 



Table 1: Performance with UDP traffic and 12 orthogo- 
nal channels available 



Number 


Optimal 


Throughput 


Average 


of APs 


Throughput 


achieved by 


ARACHNE 




(Mbps) 


ARACHNE (Mbps) 


iterations 


10 


48.2 


45.1 


3 


20 


79.4 


76.4 


5 


30 


131.2 


121.3 


8 


40 


164.8 


151.1 


11 


50 


208.6 


192.7 


15 


60 


249.1 


230.2 


19 



Table 2: Performance with UDP traffic and 3 orthogonal 
channels available 



Number 
of APs 


Optimal 
Throughput 

(Mbps) 


Throughput 
achieved by 
ARACHNE (Mbps) 


Average 
ARACHNE 
iterations 


10 


42.7 


38.4 


2 


20 


64.8 


59.2 


3 


30 


112.2 


102.2 


6 


40 


139.1 


121.7 


9 


50 


194.2 


172.4 


11 


60 


227.5 


211.8 


15 



5.7. Simulations with saturated UDP traffic and VoIP 

To begin with, we opt to observe the variation in 
the total network throughput, as we increase the num- 
ber of source-destination pairs. For this, we progres- 
sively increase the number of associated clients from 
5 to 70. Here the network consists of 10 mesh APs, 
each of which is equipped with 2 wireless interfaces for 
the backhaul communication. We observe in Fig. |5(a)| 
that the performance with ARACHNE is similar to the 
other 3 policies, when the number of clients (and there- 
fore the communication load) is low. However, with 
increased load ARACHNE manages to provide much 
higher end-to-end throughputs (up to 85% difference!), 
due to its efficient end-to-end channel selection strat- 
egy. Hyacinth is designed especially for wireless inter- 
net traffic which is directed to/from the wireless gate- 
ways. In our simulation scenario where saturated UDP 
traffic is supported, the tree-based architecture is inca- 
pable to support dynamic traffic variations in the net- 
work (especially in high communication load). In low 
load conditions Hyacinth performs close to ARACHNE. 
Contrarily, when the number of clients in the networks 
increases the performance drops and the throughput sat- 
uration point is reached quite quickly. 
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5 10 20 30 40 50 60 70 

Number of stations 

(a) Total throughput Vs. # clients. 




10 20 30 40 50 60 70 80 

Number of APs 

(b) Total throughput Vs. # APs. 



Figure 5: ARACHNE povides very high total end-to- 
end throughput with saturated UDP traffic. 



Next, we examine how close ARACNE is performing 
to optimal and its' scalability. We measure the total end- 
to-end network throughput as we increase the number of 
APs (i.e., the mesh density) from 10 to 80 (we increase 
the number of clients along with the mesh APs: 10 APs 
- 40 clients, 20 APs - 80 clients, etc.). Note that the 
interference is dynamically changing while the number 
of the APs in the network increases; the airtime cost 
metric manages to effectively capture the varying co- 
channel interference. 

The first objective in our study is to examine how ef- 
ficient ARACHNE approaches the optimal network per- 
formance. Our methodology includes the incorporation 
of the operational parameters in the IBM ILOG CPLEX 
solver while the number of the existing APs in the net- 
work increases. The optimal channel allocation then is 
applied in our simulation scenarios (OPNET). We apply 
ARACHNE under the same operational parameters of 
the network and compare the achieved throughput to the 
optimal throughput. In table[TJ we observe the total net- 



Table 3: Performance with VoIP traffic and 12 orthogo- 
nal channels available 



Number 
of VoIP 
sessions 


Optimal 
end-to-end 

delay (s) 


End-to-end delay 
achieved by 
ARACHNE (s) 


Average 
ARACHNE 
iterations 


4 


0.0001 


0.0002 


2 


8 


0.0008 


0.001 


4 


12 


0.001 


0.004 


7 


16 


0.006 


0.009 


11 


20 


0.011 


0.016 


14 


24 


0.014 


0.019 


18 



Table 4: Performance with VoIP traffic and 3 orthogonal 
channels available 



Number 
of VoIP 
sessions 


Optimal 
end-to-end 
delay (s) 


End-to-end delay 
achieved by 
ARACHNE (s) 


Average 
ARACHNE 
iterations 


4 


0.0009 


0.0012 


2 


8 


0.0014 


0.0028 


3 


12 


0.0044 


0.0069 


4 


16 


0.0092 


0.014 


8 


20 


0.016 


0.02 


9 


24 


0.019 


0.024 


12 



work throughput that is achieved by ARACHNE after 
a small number of iterations till convergence, compared 
to the optimal network throughput (when 12 orthogo- 
nal channels are available). Moreover, table [2] contains 
similar results when there are just 3 available orthog- 
onal channels availabe to allocate in the network. A 
general outcome is that ARACHNE is able to approach 
the optimal network performance and scales efficiently 
while the network topology and the load vary. The best 
performance is achieved within the case of 12 orthogo- 
nal channels, where ARACHNE gets as input more fre- 
quencies to allocate and converges to channel allocation 
solutions close to optimal (by applying its' sophisticated 
policy). 

Fig. |5(b)| depicts the network performance in terms of 
total network throughput. We observe that ARACHNE 
is able to adapt to topology and load variations, and 
therefore it manages to provide the most beneficial se- 
lection of the available channels at the mesh backhaul. 
The tree-based architecture that is introduced by Hy- 
acinth does not scale well and therefore the achieved 
performance is worst. 

In order to observe the performance of our protocol 
with delay-sensitive data exchange, we utilize varying, 
parallel, end-to-end VoIP traffic sessions. Tables [3] and 
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Figure 6: VoIP simulation results. 



IDpresent the average end-to-end delay that ARACHNE 
achieves (when 3 and 12 orthogonal channels are sup- 
ported in the network). The optimal end-to-end delay 
values are used (again) as a benchmark. We observe 
that ARACHNE can perform close to optimal after a 
lightweight execution in the network (i.e. converges af- 
ter a small number of iterations). 

Fig. |6(a)| depicts the average end-to-end de- 
lay of VoIP packet transmissions. We observe that 
ARACHNE achieves low end-to-end delays, since it 
considers the load of the individual links across a route 
in the process of assigning frequencies to wireless inter- 
faces. Recall here that ARACHNE is expected to pro- 
vide high benefits when operating in conjunction with 
load-aware routing protocols, such as RM-AODV (this 
is case with our simulations), since they both take into 
consideration the load that is experienced by individual 
links. Furthermore, 

Fig. |6(b)| shows the average number of dropped data 
packets due to channel errors and contention. The per- 
formance of ARACHNE is impressive, since packet 
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Figure 7: Simulation results with real traces: 2 inter- 
faces per node. 



dropping is kept in very low levels, as compared to 
Hyacinth and to the other strategies. Hyacinth does 
not consider the channel allocation at the access level 
and therefore, it can support less VoIP sessions than 
ARACHNE can (which provides end-to-end channel 
selection and therefore the provided QoS to the VoIP 
clients is higher). 

5.2. Simulation results with real traces 

Next, we present our simulation experiments with 
real traces from Dartmouth College and IBM. This will 
reveal the behavior of ARACHNE in more realistic sce- 
narios. By analyzing the SNMP logs from each AP, we 
derive the dynamic behavior of the aggregated traffic de- 
mand. Note that these traces have been gathered from 
real wireless networks settings. Therefore, they repre- 
sent actual traffic demands and behaviors of the APs and 
clients in a real network. The traces from Dartmouth 



College 12711 (3/2001) involve the buildings labeled as 
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Figure 8: Simulation results with real traces: 4 inter- 
faces per node. 



Figure 9: Simulation results with real traces: Varying 
the number or the supported interfaces and the available 
channels in the network with Dartmouth traces. 



AcadBldglO, SocBldg4, and include approximately 40 
APs (the traces contain the AP location; we have placed 
them accordingly in our simulation space). In addi- 
tion, the traces from IBM 111] (8/2002) have been gath- 
ered from the buildings labeled as MBldg, and SBldg. 
The number of the APs that are placed in these build- 
ings varies (the traces don't contain their locations and 
therefore we have simulated various random topologies 
with them). Note that the number of active clients (end- 
hosts) varies in both trace sets and they are uniformly 
distributed in our experiments (the traces do not contain 
information about client locations). In order to import 
the traces in our simulation we periodically compute the 
average rate for each client, once every 3 minutes, and 
we correspond the computed average values to the traf- 
fic demands of each client. 

To begin with, we consider 2 wireless interfaces at 
each AP for the backhaul communication. Fig. |7(a)| 
and |7(b)| show that ARACHNE achieves the highest 
total network throughput in all cases. Our simulation 



experiments reveal that channel selection at the access 
level plays an important role in the overall end-to-end 
performance. Being based on the airtime cost metric, 
ARACHNE takes into account the channel conditions 
as well as the communication load in both access level 
and the mesh backhaul (not considered by the other ap- 
proaches, like Hyacinth) it manages to boost the net- 
work throughput. Furthermore, Fig. |8(a)| and |8(b)| de- 
pict the performance of the compared strategies when 
the mesh APs are equipped with 4 wireless interfaces. 
The results are similar as above. 

Last but not least, we perform some simulations 
where we vary both the number of available channels 
and the number of the wireless interface supported by 
each AP. Fig. [9(a)] [9(b)] [T0(a)[ and [T0(b)[ show that 
ARACHNE achieves higher throughput values com- 
pared to Hyacinth QV|] . Throughput improvement is sig- 
nificantly increasing while we the number of available 
channels and the wireless interfaces increase. 
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Figure 10: Simulation results with real traces: Varying 
the number or the supported interfaces and the available 
channels in the network with IBM traces. 



6. Conclusions and Future Work 

In this paper we presented a channel allocation 
methodology with main objective to maximize the end- 
to-end throughput in wireless multi-radio mesh net- 
works. The modeling of the channel allocation prob- 
lem inspired the design of a routing-aware channel se- 
lection mechanism (ARACHNE). ARACHNE adopts a 
metric that provides an estimation of the average packet 
transmission delay in a communication channel. In ad- 
dition, ARACHNE captures the interference effects in 
the networks and applies an end-to-end channel selec- 
tion policy that manages to provide approximately the 
maximum end-to-end network throughput. Our work 
was compared against 3 other representative channel se- 
lection approaches and we showed that it outperforms 
all of them, for different traffic scenarios and network 
densities. Moreover, the simulation results showed that 
ARACHNE converges after a small number of itera- 



tions, proving in this way its' lightweight operation. 
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